SEQ 



GENERAL INFORMATION 



(i) 
(ii) 



APPLICANT: 
TITLE OF INVENTION 



NCE LISTING 



Darre 11 Anderson, Nabil Hanna, John Leonard, 
Rolan I Newman and Mitchell Reff 



THERAPEUTIC APPLICATION OF 
ICHIMERIC ANTIBODY TO HUMAN B 
(LYMPHOCYTE RESTRICTED 
(DIFFERENTIATION ANTIGEN FOR 
^REATMENT OF B CELL LYMPHOMA 



(iii) 
(iv) 



NUMBER OF SEQUENCES: 8 
CORRESPONDING ADDRESS: 



(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP: 



IDEC Pharmaceuticals Corporation 
111099 N. Torrey Pines Road, #160 
ia Jolla 
lalifornia 
}SA 
92037 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: MS.DOS 

(D) SOFTWARE: Microsoft Word 5.0 

(vi ) CURRENT APPLICATION DATA: 



APPLICATION NUMBER: 
FILING DATE: 
CLASSIFICATION: 



(A) 
(B) 
(C) 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Burgoon, Richkrd P. Jr. 

(B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DOCKE" NUMBER: 

ORMATION: 



(ix) TELECOMMUNICATION II 



(A) TELEPHONE: (619)45 

(B) TELEFAX: (619)546-95 



-0600 
74 
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0^ (2) INFORMATION FOR SEQ. ID.VNO4: 1: 

(i) SEQUENCE CHARACTERISTICS: 
5 \ 

(A) LENGTH: 8540 basfcs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes- no 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: ^EQ. ID. NO,: 1: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTC^TCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGG&GTTAGG GGCGGGATfJG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTT\C TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGfJ TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT\ TCCACAGAAT TAATTCCCCT 3 60 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA \GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC (fCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GlGACTTTCCA TTGACGTCAA 540 

TGGGTGG AC T ATTTACGGTA AACTGCCCAC TTGGCAGTAC A^CAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCfGGCATTA TGCCGAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TAT^TAGTCAT CGCTATTACC 72 0 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCOGTTTGA CTCACGGGGA 7 80 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTG^CACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATdpGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGi^TCGC CTGGAGACGC 960 

CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCf CCT GCTGCTCTGG 1020 

CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTAC\GGT GGCTGCACCA 1080 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 
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m 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC 

5 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG 
10 TGTTGAATTC AGATCCGTTA ACGGTTACCA 

GGC CGTGAT A TCTACGTATG ATCAGCCTCG 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC 

15 

TAATAAAATG AGGAAATTGC ATCGCATTGT 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT 
20 GCGGTGGGCT CTATGGAACC AGCTGGGGCT 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC 
TTCCTACTTG GCAGTACATC TACGTATTAG 

25 

GGCAGTACAT CAATGGGCGT GGATAGCGGT 
CCATTGACGT CAATGGGAGT TTGTTTTGGC 
30 GTAACAACTC CGCCCCATTG AC GC AAATGG 

TAAGCAGAGC TGGGTACGTC CTCACATTCA 
ATGGGTTGGA GCCTCATCTT GCTCTTCCTT 

35 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 
GCCCTGGGCT GCCTGGTCAA GGACTACTTC 
40 GGCGCCCTGA CCAGCGGCGT GCACACCTTC 

TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC 
AACGTGAATC ACAAGCCCAG CAACACCAAG 

45 

GACAAAACTC ACACATGCCC ACCGTGCCCA 
TTCCTCTTCC CCCCAAAACC CAAGGACACC 
50 TGCGTGGTGG TGGACGTGAG CCACGAAGAC 

GGCGTGGAGG TGCATAATGC CAAGACAAAG 
CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC 

55 

TGCAAGGTCT CCAACAAAGC CCTCCCAGCC 
GGGCAGCCCC GAGAACCACA GGTGTACACC 




GCCAAAGTAC AGTGGAAGGT GGATAACGCC 12 00 

AffiAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 

GCAGACTACG AGAAACACAA AGTCTACGCC 1320 

CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1380 

ACTACCTAGA CTGGATTCGT GACAACATGC 1440 

ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 

CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 

CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 

TGGGAAGACA ATAGCAGGCA TGC TGGGG AT 1680 

CGACAGCTA\T GCCAAGTACG CCCCCTATTG 1740 

ATTATGCCCA GTACATGACC TTATGGGACT 1800 

TCATCGCTAtYtACCATGGTG ATGCGGTTTT 1860 

TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 

ACCAAAATCA AQGGGACTTT CCAAAATGTC 1980 

GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2040 

GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 

GTCGCTGTTG CTACffiCGTGT CGCTAGCACC 2160 

TCCTCCAAGA GCACCTCTGG GGGCACAGCG 2220 

CCCGAACCGG TGACGG^GTC GTGGAACTCA 22 80 

CCGGCTGTCC TACAGTCCTC AGGACTCTAC 2340 

AGCAGCTTGG GCACCCAgVc CTACATCTGC 2400 

GTGGACAAGA AAGCAGAGCC CAAATCTTGT 2460 

GCACCTGAAC TCCTGGGGGQ ACCGTCAGTC 2520 

CTCATGATCT CCCGGACCCC VtGAGGTCACA 2580 

CCTGAGGTCA AGTTCAACTG QTACGTGGAC 2 640 

CCGCGGGAGG AGCAGTACAA CAGCACGTAC 2700 

CAGGACTGGC TGAATGGCAA GGAGTACAAG 2760 

CCCATCGAGA AAACCATCTC CAAAGCCAAA 2820 

CTGCCCCCAT CCCGGGATGA GCTGACCAAG 2880 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AACCAGGTCA 
TGGGAGAGCA 
GACGGCTCCT 
AACGTCTTCT 
CTCTCCCTGT 
TCGTGACAAC 
GCCAGCCATC 
CCACTGTCCT 
CTATTCTGGG 
GGCATGCTGG 
CCCGATCCCC 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CAAACTTGAC 
GTTCGACCAT 
GACCTACCCT 
TCTTCAGTGG 
CCTGAGAAGA 
GAACCACCAC 
GAACAACCGG 
TACCAGGAAG 
GAATTTGAAA 
GAATACCCAG 
GTCTACGAGA 
CTATGCATTT 



gcctgacctg 

atgggcagcc 

tcttcctct) 

catgctccgt 

ctccgggtaa 

atgcggccgt 

TGTTGTTTGC 
TTCCTAATAA 
GGGTGGGGTG 
GGATGCGGTG 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
GGCAATCCTA 
TGAACTGCAT 
GGCCTCCGCT 
AAGGTAAACA 
ATCGACCTTT 
GAGGAGCTCA 
AATTGGCAAG 
CCATGAATCA 
GTGACACGTT 
GCGTCCTCTC 
AGAAAGACTA 
TTATAAGACC 



RGAGGATCC 



CCTGGTCAAA 
GGAGAACAAC 
CAGCAAGCTC 
IATGCATGAG 
A1 

GATATCTACG 
CCCTCCCCCG 
AATGAGC 
GGGCAGGAfcA 
GGCTCTATGC 
CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATAG 
GCGTGAAGGC 
CGTCGCCGTG 
CAGGAACGAG 
GAATCTGGTG 
AAAGGACAGA 
TTTTCTTGCC 
TAAAGTAGAC 
ACCAGGCCAC 
TTTCCCAGAA 
TGAGGTCCAG 
ACAGGAAGAT 
ATGGG AC TTT 



GGCTTCTATC 
TACAAGACCA 
AC CGTGGAC A 
GCTCTGCACA 
GTTAACGGTT 
TATGATCAGC 
TGCCTTCCTT 
TTGCATCGCA 

gcaaggggga 
aaccagctgg 
jatttgcata 
gcaaatgcgt 
acaotattca 
ctaggcsagaa 
gggccaaVct 

GGTAGGATi 
CTTGGACAGC 
TGGTAGGATT 
TCCCAAAATA 
TTCAAGTACT 
ATTATGGGTA 
ATTAATATAG 
AAAAGTTTGG 
ATGGTTTGGA 
CTTAGACTCT 
ATTGATTTGG 
GAGGAAAAAG 
GCTTTCAAGT 
TGCTGGCTTT 



CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 
ACCAACTACC 
CTCGACTGTG 
GACCCTGGAA 
TTGTCTGAGT 
GGATTGGGAA 
GGCTCGACAG 
ATGAGAAAAA 
TGCCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
vTCAGGGCTGC 
TTATCCCCGC 
TGQGGATTGG 
TCCAAAGAAT 
GGAAAACCTG 
TTCTCAGmG 
ATGATGCCTTZ 
TAGTCGGAGG 
TTGTGACAAG 
GGAAATATAA 
GCATCAAGTA 
TCTCTGCTCC 
AGATCAGCCT 



CGCCGTGGAG 
GCTGGACTCC 
GCAGCAGGGG 
GCAGAAGAGC 
TAGACTGGAT 
CCTTCTAGTT 
GGTGCCACTC 
AGGTGTCATT 
GACAATAGCA 
CGCTGGATCT 
AAGGAAAATT 
GATGCTTTAG 
CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
GATTTCGCGC 
TGCCATCATG 
CAAGAACGGA 
GACCACAACC 
GTTCTCCATT 

agaactcaaa 
aagacttatt 
:agttctgtt 
gajtcatgcag 
acttctccca 
taagtsttgaa 
cctcct^aag 
cgactgtgsc 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
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TTCTAGTTGC CAGKCATCTG TTGTTTGCCC 
TGCCACTCCC ACTGTCCTTT CCTAATAAAA 

5 \ 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG 
CAATAGCAGG CATGCTGGGG ATGCGGTGGG 

10 TAGCTTTGCT TCTCAATTTcX TTATTTGCAT 

ACCAATTCAG TAGTTGATTG AGCAAATGCG 
TCTCTGCACA GATAAGGACA AAGATTATTC 
GCCAGTGAGT GGCACAGCAT TCTASGGAGA 
CGTAGAGCCA CACCTTGGTA AGGGCCAATC 

20 AGGGCAGAGC ATATAAGGTG AGGTAGGAXTC 

TTGTGTTGGG AGCTTGGATC GATCCTCTAT^ 
TCCGGCCGCT TGGGTGGAGA GGCTATTCGG 

25 

CTCTGATGCC GCCGTGTTCC GGCTGTCAGC 
CGACCTGTCC GGTGCCCTGA ATGAACTGCA 
30 CACGACGGGC GTTCCTTGCG CAGCTGTGCT 

GCTGCTATTG GGCGAAGTGC CGGGGCAGGA 
GAAAGTATCC ATCATGGCTG ATGCAATGCG 

35 

CCCATTCGAC CACCAAGCGA AACATCGCAT 
TCTTGTCGAT CAGGATGATC TGGACGAAGA 
40 CGCCAGGCTC AAGGCGCGCA TGCCCGACGG 

CTGCTTGCCG AATATCATGG TGGAAAATGG 
GCTGGGTGTG GCGGACCGCT ATCAGGACAT 

45 

GCTTGGCGGC GAATGGGCTG ACCGCTTCCT 
GCAGCGCATC GCCTTCTATC GCCTTCTTGA 
50 GAAATGACCG ACCAAGCGAC GCCCAACCTG 

TTCTATGAAA GGTTGGGCTT CGGAATCGTT 
CGCGGGGATC TCATGCTGGA GTTCTTCGCC 

55 

GGTTACAAAT AAAGCAATAG CATCACAAAT 
TCTAGTTGTG GTTTGTCCAA ACTCATCAAT 




CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4 680 

TGAGGAAATT GCATCGCATT GTCTGAGTAG 4740 

GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 

CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 

AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 

TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 

AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 

AATATGC TTG TCATCACCGA AGCCTGATTC 5100 

TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 

AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 

GGTTGAACAA GATGGATTGC ACGCAGGTTC 52 80 

(STATGACTGG GCACAACAGA CAATCGGCTG 5340 

GcVgGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 

GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 5460 

CGACGTSTGTC ACTGAAGCGG GAAGGGACTG 5520 

TCTCCTgVcA TCTCACCTTG CTCCTGCCGA 5580 

GCGGCTGCAV ACGCTTGATC CGGCTACCTG 5640 

CGAGCGAGCaVgTACTCGGA TGGAAGCCGG 5700 

GCATCAGGGG cV:GCGCCAG CCGAACTGTT 5760 

CGAGGATCTC GTCSTGACCC ATGGCGATGC 5820 

CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 

AGCGTTGGCT ACCCGTQATA TTGCTGAAGA 5940 

CGTGCTTTAC GGTATCGCOG CTCCCGATTC 6000 

CGAGTTCTTC TGAGCGGGAcVrCTGGGGTTC 6060 

CCATCACGAG ATTTCGATTC cVcCGCCGCC 6120 

TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 

CACCCCAACT TGTTTATTGC AGCtWtAAT 6240 

TTCACAAATA AAGCATTTTT TTCACTiECAT 6300 

CTATCTTATC ATGTCTGGAT CGCGGCCGCG 6360 
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ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 6420 

CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 6480 
5 \ 

AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 6540 

ACCTGTCGTG CCAGCTGCAT TAAT^AATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 6 600 

10 TTGGGCGCTC TTCCGCTTCC TCGCTOACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6 660 

GAGCGGTATC AGCTCACTCA AAGGCGgVaA TACGGTTATC CACAGAATCA GGGGATAACG 6720 



15 



CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGC C AG GAACCGTAAA AAGGCCGCGT 6780 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC^CTGACGAGCA TCACAAAAAT CGACGCTCAA 6840 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT ^AAAGATACCA GGCGTTTCCC CCTGGAAGCT 6900 

20 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 6960 

CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 7020 

TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 7080 
25 \ 

TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 7140 

CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGC T AC A GAGTTCTTGA 72 00 

\^ 

30 AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 72 60 

\ 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC\CGGCAAACAA ACCACCGCTG 7320 

GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AG ATT AC GCG OAGAAAAAAA GGATCTCAAG 7380 

\ 
\ 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAi^CGAAAAC TCACGTTAAG 7440 

GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 7500 

40 GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAAC TTG GTCTGACAGT TACCAATGCT 7560 

TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCGATA GTTGCCTGAC 7620 

TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7 680 
45 \ 

TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAE CAGCCAGCCG 7740 

GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG \TCTATTAATT 7800 

50 GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC G^TGTTGCCA 7860 

TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGQTCCGGTT 7920 



35 



55 



30TC 



CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7980 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 8040 
CAGCACTGCA TAATTCTCTT AC TGTC ATGC CATCCGTAAG ATGCTTTTCT GTGAC TQGTG 8100 
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10 



15 



25 



30 



(h' 35 



40 



45 



50 



55 



AGTACTCAAC 
CGTCAATACG 
AACGTTCTTC 
AACCCACTCG 
GAGCAAAAAC 
GAATACTCAT 
TGAGCGGATA 
TTCCCCGAAA 



i C AAGTC ATTC 

5ataatacc 
gggx3cgaaaa 
tgcaVccaac 
aggaaggcaa 

ACTCTTCGTT 
CATATTTG 
AGTGCCACCT 



TGAGAATAGT 
GCGCCACATA 
CTCTCAAGGA 
TGATCTTCAG 
AATGCCGCAA 
TTTCAATATT 
TGTATTTAGA 



p 20 (3) INFORMATION FOR SEQ. ID. NO<: 2: 



(i) 




GTATGCGGCG 
GCAGAACTTT 
TCTTACCGCT 
CATCTTTTAC 
AAAAGGGAAT 
ATTGAAGCAT 
AAAATAAACA 



ACCGAGTTGC 
AAAAGTGCTC 
GTTGAGATCC 
TTTCACCAGC 
AAGGGCGACA 
TTATCAGGGT 
AATAGGGGTT 



TCTTGCCCGG 
ATCATTGGAA 
AGTTCGATGT 
GTTTCTGGGT 
CGGAAATGTT 
TATTGTCTCA 
CCGCGCACAT 



SEQUENCE C 



CTERISTICS: 



(A) LENGTH: 9209 bases 

(B) TYPE: nucleicWid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 



(ii) 



MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: -yes- no 

(iv) ANTI-SENSE: no 
SEQUENCE DESCRIPTION: SEQ. ID. NO,: 2: 



(ix) 



GACGTCGCGG 
AGGCCGAGGC 
GGAGAATGGG 
ATGGTTGCTG 
GACTTTCCAC 
GGGGAGCCTG 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 



CCGCTCTAGG 
GGCCTCGGCC 
CGGAACTGGG 
ACTAATTGAG 
ACCTGGTTGC 
GGGACTTTCC 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 



CCTCCAAAAA 
TCTGCATAAA 
CGGAGTTAGG 
ATGCATGCTT 
TGACTAATTG 
ACACCCTAAC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 



AGCCTCCTC/ 

TAAAAAAAAT 

GGCGGGATGG 

TGCATACTTC 

AGATGCATGC 

TGACACACAT 

TTAGTTCATA 

GGCTGACCGC 

ACGCCAATAG 



CTACTTCTGG 
'AGTCAGCCA 
GCGGAGTTAG 
TGCiXTGCTGG 
TTTGCATACT 
TCCACAQAAT 
GCCCATATAT 
CCAACGACCC' 
GGACTTTCCA 



AATAGCTCAG 
TGCATGGGGC 
GGGCGGG AC T 
GGAGCCTGGG 
TCTGCCTGCT 
TAATTCCCCT 
GGAGTTCCGC 
CCGCCCATTG 
?GACGTCAA 



8160 
8220 
8280 
8340 
8400 
8460 
8520 
8540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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TGGGTGGAC T ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 
TTTCCAAGTC TCCACCCOAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 
CGGTGGGAGG TCTATATAAG ^AGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 
CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 
GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 
GCATCTCCAG GGGAGAAGGT CACAA^GACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 
CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 
CTGGCTTCTG GAGTCCCTGT TCGCTTCAOT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 
ACAATCAGCA GAGTGGAGGC TGAAGATGCT G^CC ACTTATT ACTGCCAGCA GTGGACTAGT 
AACCCACCCA CGTTCGGAGG GGGGACCAAG CTOSAAATCA AACGTACGGT GGCTGCACCA 
TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 
TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAaVgTAC AGTGGAAGGT GGATAACGCC 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 
AGCCTCAGCA GCACCCTGAC GC TGAGC AAA GCAGACTAOG AGAAACACAA AGTCTACGCC 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAaNaGAGCTTCAA CAGGGGAGAG 
TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 
TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTOTAT TCTGGGGGGT 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGGA TGCTGGGGAT 
GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACgXcCCCCTATTG 
ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATQCGGTTTT 
GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTSCCACC 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 
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GTAACAACTC: CGCCCCATTG ACGCAAATGG 

taagcagagcVtgggtacgtc CTCACATTCA 

5 atgggttgga ggctcatctt gctcttcctt 

gtacaactgc agcagcctgg ggctgagctg 

tgcaaggctt ctggc'flacac atttaccagt 
10 \ 

GGTCGGGGCC TGGAATGQAT TGGAGCTATT 
CAGAAGTTCA AAGGC AAGGOv C AC ATTGACT 
15 CAGCTCAGCA GCCTGACATC TGAGGACTCT 

TACGGCGGTG ACTGGTACTT CAATGTCTGG 
GCTAGC AC CA AGGGCCCATC GGTCTTCCCC 

20 \ 

GGCACAGCGG CCCTGGGCTG CCTGGTCAAG 
TGGAACTCAG GCGCCCTGAC CAGCGGCGTGl 
25 GGACTCTACT CCCTCAGCAG CGTGGTGACC 

TACATCTGCA ACGTGAATCA CAAGCCCAGC 
AAATCTTGTG AC AAAAC TC A CACATGCCCA 

30 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC 
GAGGTCACAT GCGTGGTGGT GGACGTGAGC 
35 TACGTGGACG GCGTGGAGGT GCATAATGCC 

AGCACGTACC GTGTGGTCAG CGTCCTCACC 
GAGTACAAGT GCAAGGTCTC CAACAAAGCC 

40 

AAAGCCAAAG GGCAGCCCCG AGAACCACAG 
CTGACCAAGA ACCAGGTCAG CCTGACCTGC 
45 GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG 

CTGGACTCCG ACGGCTCCTT CTTCCTCTAC 
CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG 

50 

CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA 
AGAC TGGATT CGTGACAACA TGCGGCCGTG 
55 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 

GTGCCACTCC CACTGTCCTT TCCTAATAAA 




GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2340 

GTGATCAGCA C TGAAC AC AG ACCCGTCGAC 2400 

GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 

GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 

TACAATATGC AC TGGGT AAA ACAGACACCT 2580 

TATCCCGGAA ATGGTGATAC TTCCTACAAT 2 640 

GCAGACAAAT CCTCCAGCAC AGCCTACATG 2700 

GCGGTCTATT AC TGTGC AAG ATCGACTTAC 2760 

GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 2820 

CTGGCACCCT CCTCCAAGAG CACCTCTGGG 2880 

GACTACTTCC CCGAACCGGT GACGGTGTCG 2940 

CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 

OTGCCCTCCA GCAGCTTGGG CACCCAGACC 3060 

AACACCAAGG TGGACAAGAA AGCAGAGCCC 3120 

CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3180 

AAGGACACCC TCATGATCTC CCGGACCCCT 3240 

CACGAAGACta CTGAGGTCAA GTTCAACTGG 3300 

AAGACAAAGC (SGCGGGAGGA GCAGTACAAC 33 60 

GTCCTGCACC AGG^ACTGGCT GAATGGCAAG 3420 

CTCCCAGCCC CCATOGAGAA AACCATCTCC 3480 

GTGTACACCC TGCCCCOATC CCGGGATGAG 3540 

CTGGTCAAAG GCTTCTATCC CAGCGACATC 3600 

GAGAACAACT ACAAGACCAC 1GCCTCCCGTG 3660 

AGCAAGCTCA CCGTGGACAA GAlSCAGGTGG 3720 

ATGCATGAGG CTCTGCACAA CCACXACACG 37 80 

TGAGGATCCG TTAACGGTTA CCAACTACCT 3 840 

ATATCTACGT ATGATCAGCC TCGACTGTOC 3900 
CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG\ 3960 

ATGAGGAAAT TGCATCGCAT TGTCTGAGTA N4020 
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GGTGTJCATTC 
ACAATAOCAG 
GCTGGATCTSC 
AGGAAAATTA 
ATGCTTTAGA 
CAGAGCTGAG 
ATCACCGAAG 
ATAGAGAGGG 
CATTTGCTTC 
ATTTCGCGCC 
GCCATCATGG 
AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 
CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTGGGAAGAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 



ffTCCG 



TATTCTGGGG 

GCATGCTGGG 

CCGATCCCCA 
kTTTTAACAC 

GACAGTGTTC 
ACTCICTAAGC 
CCTGA* 
CAGGAGC5AG 
TGACATAGT 
AAACTTGACG 
TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 
CTGAGAAGAA 
AACCACCACG 
AACAACCGGA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 
TATGCATTTT 
TCTAGTTGCC 
GCCACTCCCA 
TGTCATTCTA 
AATAGCAGGC 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 



GGTGGGGTGG 
GATGC GGTGG 
GCTTTGCTTC 

caattcagta 
tctgcacaga 
cagtgagtgg 
tagagccaca 
ggcagagcat 
gtgttgggag 
1caatcctag 
gaVctgcatc 

GCCTiSCGCTC 

aggtaaVcag 

tcgaccttVa. 

agg agc tc at" 

attggcaagt 

catgaatcaa 

tgacacgttt 

cgtcctctct 

gaaagactaa 

tataagacca 

agccatctgt 

ctgtcctttc 

ttctgggggg 

atgctgggga 

CTCAATTTCT 
AGTTGATTGA 
ATAAGGACAA 
GCACAGCATT 



GGCAGGACAG 
GCTCTATGGA 
TCAATTTCTT 
GTTGATTGAG 
TAAGGACAAA 
CACAGCATTC 
CCTTGGTAAG 
ATAAGGTGAG 
CTTGGATAGC 
CGTGAAGGCT 
GTCGCCGTGT 
AGGAACGAGT 
AATCTGGTGA 
AAGGACAGAA 
TTTCTTGCCA 
X3TAGACA 
CCAfSGCCACC 

ttcccVgaaa 
gaggtcc\gg 
caggaagat) 

TGGGACTTTT 
TGTTTGCCCC 
CTAATAAAAT 
TGGGGTGGGG 
TGCGGTGGGC 
TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGGGAGAA 



CAAGGGGGAG 
ACCAGCTGGG 
ATTTGCATAA 
CAAATGCGTT 
CATTATTCAG 
TAGGGAGAAA 
GGCCAATCTG 
GTAGGATCAG 
TTGGACAGCT 
GGTAGGATTT 
CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 
TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 
1CTGGCTTTA 
TOSCCCGTGC 
GAGGAAATTG 
CAGGACAGCA 
TCTATGG? 
ATGAGAAAAA >I 
TGCCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 



gattgggaag 
gctcgacagc 
tgagaaaaaa 
gccaaaaagg 
agggagtacc 
tatgcttgtc 
ctcacacagg 
ttgctcctca 
cagggctgcg 
tatccccgct 
ggggattggc 
ccaaagaatg 
gaaaacctgg 
tctcagtaga 
tgatgcctta 
agtcggaggc 
tgtgacaagg 
gaaatataaa 
catcaagtat 
ctctgctccc 
gatcagcctc 
cttccttgac 
catcgcattg 
agggggagga 
c agc tggggc 
aaggaaaatt 
gVtgctttag 
cca!sagctga 
catca&cgaa 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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GCCTGATTCC 

GCAGGAGCCA 

CTGACATAGT 

CGCAGGTTC: 

AATCGGCTGC 

TGTCAAGACC 

GTGGCTGGCC 

AAGGG AC TGG 

TCCTGCCGAG 

GGCTACCTGC 

GGAAGCCGGT 

CGAACTGTTC 

TGGCGATGCC 

CTGTGGCCGG 

TGCTGAAGAG 

TCCCGATTCG 

CTGGGGTTCG 

ACCGCCGCCT 

ATCCTCCAGC 

GCTTATAATG 

TCACTGCATT 

GCGGCCGCGA 

ATTGTTATCC 

GGGGTGCCTA 

AGTCGGGAAA 

GTTTGCGTAT 

GGCTGCGGCG 

GGGATAACGC 

AGGCCGCGTT 



GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
CCGGCCGCTT 
TGATGCCG 
GACCTGTCCG 
ACGACGGGCG 
CTGCTATTGG 
AAAGTATCa 
CCATTCGACC 
CTTGTCGATC 
GCCAGGCTCA 
TGCTTGCCGA 
CTGGGTGTGG 
CTTGGCGGCG 
CAGCGCATCG 
AAATGACCGA 
TCTATGAAAG 
GCGGGGATCT 
GTTACAAATA 
CTAGTTGTGG 
TCCCGTCGAG 
GCTCACAATT 
ATGAGTGAGC 
CCTGTCGTGC 
TGGGCGCTCT 
AGCGGTATCA 
AGGAAAGAAC 
GCTGGCGTTT 



ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATCG 
GGGTGGAGAG 
CCGTGTTCCG 
GTGCCCTGAA 
TTCCTTGCGC 
GCGAAGTGCC 
TCATGGCTGA 
.CCAAGCGAA 
AGSATGATCT 
AGGCiSCGCAT 
ATATCATCGT 
CGGACCGC! 
AATGGGCTGA 
CCTTCTATCG 
CCAAGCGACG 
GTTGGGCTTC 
CATGCTGGAG 
AAGCAATAGC 
TTTGTCCAAA 
AGCTTGGCGT 
CCACACAACA 
TAACTCACAT 
CAGCTGCATT 
TCCGCTTCCT 
GCTCACTCAA 
ATGTGAGCAA 
TTCCATAGGC 



GGGCCAATCT 

GGTAGGATCA 

ATCCTCTATG 

GCTATTCGGC 

GCTGTCAGCG 

TGAAC TGC AG 

AGCTGTGCTC 

GGGGCAGGAT 

TGCAATGCGG 

ACATCGCATC 

GGACGAAGAG 

GCCCGACGGC 

GGAAAATGGC 

TCAGGACATA 

^CGCTTCCTC 

COTTCTTGAC 

CCCAACCTGC 

GGAATCGTTT 

TTCTTCGCCC 

ATCACAAAT' 

CTCATCAATC 

AATCATGGTC 

TACGAGCCGG 

TAATTGCGTT 

AATGAATCGG 

CGCTCACTGA 

AGGCGGTAAT 

AAGGCCAGCA 

TCCGCCCCCC 



gc tc ac ac ag 

gttgctcctc 

gttgaacaag 

tatgactggg 

caggggcgcc 

gacgaggcag 

gacgttgtca 

ctcctgtcat 

cggc tgc at a 

gagcgagcac 

catcaggggc 

gaggatctcg 

cgcttttctg 

gcgttggcta 

gtgctttacg 

gagttcttct 

catcacgaga 

tccgggacgc 

accccaactt 

tcacaaataa 

tatcttatca 

atasctgttt 

aagca\aaag 

gcgctcaotg 

ccaacgcgc 

ctcgctgcgc 

acggttatcc 

aaaggccagg 

tgacgagcat 



GATAGAGAGG 
ACATTTGCTT 
ATGGATTGCA 
CACAACAGAC 
CGGTTCTTTT 
CGCGGCTATC 
CTGAAGCGGG 
CTCACCTTGC 
CGCTTGATCC 
GTACTCGGAT 
TCGCGCCAGC 
TCGTGACCCA 
GATTCATCGA 
CCCGTGATAT 
GTATCGCCGC 
GAGCGGGACT 
TTTCGATTCC 
CGGCTGGATG 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAAGCCT 
CCCGCTTTCC 
GGGAGAGGCG 
RCGGTCGTTC 
ACAGAATCAG 
AACCQTAAAA 
CACAAAAATC 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
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GAOSCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC 7560 

CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 7 620 

5 CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG TATCTCAGTT 7 680 

\ 

CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 7740 

GCTGCGCCTtXaTCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 7 800 

10 \ 

CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 7860 

\ 

\ 

AGTTCTTGAA GTGGTGGCCT AACTACGGCT AC AC T AG AAG GACAGTATTT GGTATCTGCG 7920 

15 CTCTGCTGAA GCCAGrrACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA 7980 

CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 8040 

GATCTCAAGA AGATCCTT^G ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 8100 

CACGTTAAGG GATTTTGGTC^ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA 8160 

ATTAAAAATG AAGTTTTAAA TCAATC T AAA GTATATATGA GTAAACTTGG TCTGACAGTT 8220 

25 ACCAATGCTT AATCAGTGAG GCAOCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 82 80 

\ 

TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 8340 

GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 8400 

30 \ 

AGCCAGCCGG AAGGGCCGAG CGCAGAAGTQ^ GTCCTGCAAC TTTATCCGCC TCCATCCAGT 8460 

CTATTAATTG TTGCCGGGAA GCTAGAGTAA i^TAGTTCGCC AGTTAATAGT TTGCGCAACG 8520 

35 TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA 8580 

GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 8640 

TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGT^AGTT GGCCGCAGTG TTATCACTCA 8700 

40 \ 

TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGC^C ATCCGTAAGA TGCTTTTCTG 87 60 

TG AC TGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTGx TATGCGGCGA CCGAGTTGCT 8820 

45 CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG OAGAACTTTA AAAGTGC TC A 8880 

TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTHACCGCTG TTGAGATCCA 8940 



50 



GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 9000 

TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 9060 

GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCaVt TATCAGGGTT 9120 

55 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAAV ATAGGGGTTC 9180 

CGCGCACATT TCCCCGAAAA GTGCCACCT \ 9209 
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°^ (4) INFORMATION FOR SEQ. ID. N0|: 3: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 54 bases 

(Bl) TYPE: nucleic acid 

(Cj) STRANDEDNESS: single 

(D]j TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

0- (iii) HYPOTHETICAL: yea- no 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ. ID. NOj: 3: 

20 



30 



ATC ACA | GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 

TC a— 



(5) INFORMATION FOR SEQ. ID. NO f : 4: 
(i) SEQUENCE CHARACTERISTICS: 

25 r 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

0^ (iii) HYPOTHETICAL: yes- no 

35 (iv) ANTI-SENSE: yes 

^ (ix) SEQUENCE DESCRIPTION: SEQ. ID. NO,: 4: 

40 -5^" TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT "3"*- 

^ (6) INFORMATIONFORSEQ.ID.NO,: 5: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 
5 (iv) ANTI^SENSE: no 

Qu/ (ix) SEQUENCE DESCRIPTION: SEQ. ID. NO ( : 5: 



1 



ATG 


GAT 


TTT 


CAG 


GTG 


CAG 


ATT 


ATC 


AGC 


TTC 


CTG 


CTA 


ATC 


AGT 


GCT 


TCA 


GTC 


51 


ATA 


ATG 


TCC 


AGA 


\ 

GGq 


CAA 


ATT 


GTT 


CTC 


TCC 


CAG 


TCT 


CCA 


GCA 


ATC 


CTG 


TCT 


102 


GCA 


TCT 


CCA 


GGG 


GAG 


iAAG 


GTC 


ACA 


ATG 


ACT 


TGC 


AGG 


GCC 


AGC 


TCA 


AGT 


GTA 


153 


AGT 


TAC 


ATC 


CAC 


TGG 


TTC 


CAG 


CAG 


AAG 


CCA 


GGA 


TCC 


TCC 


CCC 


AAA 


CCC 


TGG 


204 


ATT 


TAT 


GCC 


ACA 


TCC 


AAC 


CTG 


GCT 


TCT 


GGA 


GTC 


CCT 


GTT 


CGC 


TTC 


AGT 


GGC 


255 


AGT 


GGG 


TCT 


GGG 


ACT 


TC'^ 


TAC 


TCT 


CTC 


ACA 


ATC 


AGC 


AGA 


GTG 


GAG 


GCT 


GAA 


306 


GAT 


GCT 


GCC 


ACT 


TAT 


TAQ 


TGC 


CAG 


CAG 


TGG 


ACT 


AGT 


AAC 


CCA 


CCC 


ACG 


TTC 


357 


GGA 


GGG 


GGG 


ACC 


AAG 


CTG 


\gaa 


ATC 


AAA 


















384 



^ (7) INFORMATION FOR ^EQ. ID. NO ( : 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nuclfeicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:\ linear 



(ii) MOLECULE TYPE: \DNA (genomic) 

(iii) HYPOTHETICAL: ye\ 
40 (iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ. ID. NO.: 6: 

Qy 45 *r GCG GCT CCC ACG cgt GTC CTG\ TCC CAG -3"^ 27 
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(8) INFORMATION FOR SEQ. ID. NO<: 7: 




(i) \SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 
(B\ TYPE: nucleic acid 

(C) \ STRANDEDNESS: single 

(D) \ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 

(ix) SEQUENCe\)ESCRIPTION: SEQ. ID. NO,: 7: 

5' GG(G/C) TGT TGT GC^T ACC TC{A/C) — (A/C)GA,GAC ( G / A ) GT CA 3 



0-2- 

INFORMATION I 



?or sib 



Q. ID. NOj: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 Bases 

(B) TYPE: nucleic ackl 

(C) STRANDEDNESS\ single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 
(ix) SEQUENCE DESCRIPTION: SEQ, ID. NO,: 8: 



29 



40 



45 



50 



ATG 


GGT 


TGG 


AGC 


CTC 


ATC 


TTG 


CTC 


TTC 


CTT 


GTC 


GCT\ 


GTT 


GCT 


ACG 


CGT 


GTC 


51 


CTG 


TCC 


CAG 


GTA 


CAA 


CTG 


CAG 


CAG 


CCT 


GGG 


GCT 


GAG 


OTG 


GTG 


AAG 


CCT 


GGG 


102 


GCC 


TCA 


GTG 


AAG 


ATG 


TCC 


TGC 


AAG 


GCT 


TCT 


GGC 


TAC 


AcV 


TTT 


ACC 


AGT 


TAC 


153 


AAT 


ATG 


CAC 


TGG 


GTA 


AAA 


CAG 


ACA 


CCT 


GGT 


CGG 


GGC 


CTG 


Vaa 


TGG 


ATT 


GGA 


204 


GCT 


ATT 


TAT 


CCC 


GGA 


AAT 


GGT 


GAT 


ACT 


TCC 


TAC 


AAT 


CAG 




TTC 


AAA 


GGC 


255 


AAG 


GCC 


ACA 


TTG 


ACT 


GCA 


GAC 


AAA 


TCC 


TCC 


AGC 


ACA 


GCC 


TAC 


\ATG 


CAG 


CTC 


306 


AGC 


AGC 


CTG 


ACA 


TCT 


GAG 


GAC 


TCT 


GCG 


GTC 


TAT 


TAC 


TGT 


GCA 


ASA 


TCG 


ACT 


357 



-52- 



